Full-text citation analysis: A new method to enhance scholarly networks

نویسندگان

  • Xiaozhong Liu
  • Jinsong Zhang
  • Chun Guo
چکیده

In this article, we use innovative full-text citation analysis along with supervised topic modeling and networkanalysis algorithms to enhance classical bibliometric analysis and publication/author/venue ranking. By utilizing citation contexts extracted from a large number of full-text publications, each citation or publication is represented by a probability distribution over a set of predefined topics, where each topic is labeled by an author-contributed keyword. We then used publication/ citation topic distribution to generate a citation graph with vertex prior and edge transitioning probability distributions. The publication importance score for each given topic is calculated by PageRank with edge and vertex prior distributions. To evaluate this work, we sampled 104 topics (labeled with keywords) in review papers. The cited publications of each review paper are assumed to be “important publications” for the target topic (keyword), and we use these cited publications to validate our topic-ranking result and to compare different publication-ranking lists. Evaluation results show that full-text citation and publication content prior topic distribution, along with the classical PageRank algorithm can significantly enhance bibliometric analysis and scientific publication ranking performance, comparing with term frequency–inverted document frequency (tf–idf), language model, BM25, PageRank, and PageRank + language model (p < .001), for academic information retrieval (IR) systems. Introduction and Motivation

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Citation Recommendation via Proximity Full-Text Citation Analysis and Supervised Topical Prior

Currently the many publications are now available electronically and online, which has had a significant effect, while brought several challenges. With the objective to enhance citation recommendation based on innovative text and graph mining algorithms along with full-text citation analysis, we utilized proximitybased citation contexts extracted from a large number of full-text publications, a...

متن کامل

Computing Interdisciplinarity of Scholarly Objects using an Author-Citation-Text Model

There has been a growing need to determine if research proposals and results are truly interdisciplinary or to analyze research trends by analyzing research papers, reports, proposals and even researchers. In this paper, we tackle the problem and propose a method for measuring interdisciplinarity of scholarly objects. The newly proposed model takes into account authors, citations, and text cont...

متن کامل

Investigation on Full-Text Databases Cited in LIS

Background and Aim: The main objective of this research was to investigate the use of full-text databases in the LIS theses of Tehran State Universities within the years 2005 and 2009. Method: For this purpose, the total of 9952 citations related to 172 existing theses in the academic central libraries were studied. The data collected were analyzed by the bibliometrics and citation analysis met...

متن کامل

Analysis of Citation Verbs in EFL Academic Writing: The Case Study of Dissertations and Theses at the University of Dar es Salaam, Tanzania

This study was an analytical account of EFL postgraduate learners’ use of verbs in citing other scholars in their own writing. Particular interest was differing extents of these verbs as categorised by Myer (1997), namely verbs representing statement of scholarly writing, verbs communicating knowledge of scholarly writing, and verbs denoting cognition of scholarly writing, each of which has sub...

متن کامل

Recovering uncaptured citations in a scholarly network: A two-step citation analysis to estimate publication importance

The citation relationships between publications, which are significant for assessing the importance of scholarly components within a network, have been used for various scientific applications. Missing citation metadata in scholarly databases, however, create problems for classical citation-based ranking algorithms and challenge the performance of citation-based retrieval systems. In this resea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIST

دوره 64  شماره 

صفحات  -

تاریخ انتشار 2013